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Abstract 


One  may  estimate  a  conditional  hazard  function  from  grouped  (and  possibly  censored) 
survival  data  by  the  time  and  covariate  specific  occurrence/exposure  rate.  Asymptotic 
results  for  cumulative  versions  of  this  estimator  are  developed,  utilizing  the  general  frame¬ 
work  of  counting  processes.  In  particular,  a  grouped  data  based  goodness-of-fit  test  for 
Cox’s  proportional  hazard  model  is  given.  Various  constraints  on  the  asymptotic  behavior 
of  the  widths  of  the  calendar  periods  and  covaria.te  strata  employed  in  grouping  the  data 
are  needed  to  prove  the  results.  Actual  performance  of  the  estimators  and  test  statistics  is 
evaluated  by  Monte  Carlo  methods. 


1.  Introduction 

The  purpose  of  this  paper  is  to  study  grouped  data  based  inference  for  Beran’s 
(1981)  general  nonparametric  hazard  function  model  and  Cox’s  (1972)  proportional 
hazards  model.  In  the  general  nonparametric  model  the  conditional  hazard  function 
A(i|z)  of  the  survival  time  of  an  individual  with  covariate  z  is  given  by 

A(<|z)  =  a{t,z),  (1.1) 

where  a  is  an  unknown  function.  In  Cox’s  model,  a  is  specified  by 

o(t,2)  =  A0(*)exp{/3iz},  (1.2) 

where  the  covariate  is  p-dimensional,  0q  is  a  p- vector  of  unknown  regression  coeffi¬ 
cients  and  Ao  is  an  unknown  baseline  hazard  function. 

The  usual  approach  to  analyzing  grouped  survival  data  is  to  assume  that  a  is 
piecewise  constant  over  each  grouping  cell,  see  Hoem  (1987).  Then  the  likelihood 
function  is  proportional  to  the  Poisson  likelihood  (Laird  and  Olivier,  1981)  and 
the  maximum  likelihood  estimator  of  a  is  the  classical  occurrence/ exposure  rate. 
Kalbfleisch  and  Prentice  (1973).  Holford  (1976),  and  Prentice  and  Gloeckler  (1978) 
have  studied  the  maximum  likelihood  estimator  of  the  regression  parameters  in 
Cox’s  model  when  the  baseline  hazard  function  is  taken  as  a  step  function.  Holford 
(1976)  noted  that  this  estimator  is  inconsistent  unless  the  grouping  becomes  finer 
as  the  sample  size  increases. 

It  is  important  to  know  whether  the  convenience  of  analyzing  grouped  data 
from  a  given  actuarial  life  table  is  overshadowed  by  biases  that  arise  when  the 
grouping  is  coarse.  There  exist  many  numerical  studies  comparing  the  grouped  and 
continuous  Cox  model  analyses  for  specific  data  sets,  see  the  references  in  Hoem 
(1987,  p.  137).  All  these  studies  have  found  that  the  two  approaches  give  quite  sim¬ 
ilar  results.  Breslow  (1986),  considering  data  on  cancer  mortality  among  Montana 
smelter  workers,  found  that  the  estimated  regression  coefficients  from  the  grouped 
data  analysis  were  within  one  standard  error  of  those  from  the  continuous  data 
analysis.  Similar  conclusions  were  reached  by  Selmer  (1990)  for  data  on  mortality 
from  coronary  heart  disease.  Selmer  obtained  an  extremely  close  agreement  of  like¬ 
lihood  ratio  test  statistics  (used  to  test  for  differences  between  nested  models)  in 
the  full  Cox  model  and  the  Cox  model  with  a  piecewise  constant  baseline  hazard 
function.  However,  it  would  be  useful  to  have  a  theoretical  underpinning  for  these 
empirical  studies. 

Theoretical  results  for  continuous  data  are  well  developed;  see  Andersen  and 
Gill  (1982)  (henceforth  AG)  for  Cox’s  model,  and  McKeague  and  Utikal  (1990a) 
(henceforth  MU)  for  the  general  nonparametric  model.  Corresponding  results  for 
grouped  data  are  available  only  in  special  cases.  Aficionados  of  contingency  table 
analysis  might  consult  Friedman’s  (1982)  paper  on  the  Cox  model,  but  his  results 
are  difficult  to  interpret  in  the  survival  analysis  context.  Pons  and  Turckheim 
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(1987)  have  used  histogram  sieve  estimators  for  Cox’s  model  (as  have  Borgan  and 
Ramlau-Hansen  (1985)  and  Karr  (1987)  for  Aalen’s  multiplicative  intensity  model), 
but  their  approach  applies  to  grouped  data  only  when  the  co variate  takes  at  most 
finitely  many  values  and  is  non-time  dependent.  As  far  as  we  know,  the  general 
nonparametric  model  with  grouped  data  has  not  been  treated  in  the  literature. 

Our  aim  here  is  to  show  that  the  results  of  AG  and  MU  have  analogues  in  the 
grouped  data  case.  Various  conditions  on  the  asymptotic  behavior  of  the  widths  of 
the  calendar  periods  and  covariate  strata  used  in  grouping  the  data  are  needed  for 
this.  We  also  develop  a  grouped  data  version  of  MU’s  (1991)  goodness-of-fit  test  for 
the  Co:  model.  Estimators  of  cumulative  hazard  function  A(m,z)  = 

J0  a(t,z)d$  and  the  doubly  cumulative  hazard  function  *4(v)  =  f0fQadtdx  play 
an  important  role  in  this  work.  The  goodness-of-fit  test  is  based  on  a  comparison 
of  estimates  of  A  under  the  general  nonparametric  model  and  the  Cox  model. 

In  Section  2  we  formulate  the  general  model  (1.1)  in  the  (by  now  standard) 
counting  process  setting  and  discuss  the  estimation  of  A(-,z)  and  A.  Our  Cox 
model  results,  extending  AG,  are  given  in  Section  3.  The  goodness-of-fit  test  for 
the  Cox  model  is  discussed  in  Section  4.  Section  5  contains  a  simulation  study  and 
an  application  to  real  data.  All  proofs  are  contained  in  Section  6.  For  simplicity 
we  restrict  attention  to  the  case  of  a  one-dimensional  covariate  (p  =  1)  throughout 
the  paper. 

2.  Fitting  the  general  nonparametric  model  to  grouped  data 

Let  N (t)  —  (Ari(t), . . . ,  Nn(t)Y,  t  €  [0,1]  be  a  multivariate  counting  process 
with  respect  to  a  right  continuous  filtration  T\n\  where  Ni(t)  is  the  number  of 
failures  of  the  zth  individual  during  the  time  period  [0,t].  The  counting  process  N 
is  adapted  to  the  filtration  and  the  sample  paths  of  N\ , . . . ,  Nn  are  right-continuous 
step  functions,  zero  at  time  zero,  with  jumps  of  size  +1.  No  two  component  pro¬ 
cesses  jump  simultaneously.  We  also  assume  that  TV,-  has  intensity 

MO  =  *(*)*(*■  3, ■(*)),  (2.1) 

where  a  is  a  completely  general  function,  Y{(t)  is  a  predictable  {0,  l}-valued  process 
indicating  that  the  zth  individual  is  at  risk  when  Yi(t)  =  1,  and  Zi(t)  is  a  predictable 
[0,  l]-valued  covariate  process.  The  processes  M,(f)  =  N{(t)  —  f*  A i(s)ds  are  local 
martingales. 

Let  the  cells  into  which  the  data  are  grouped  be  denoted  Crj  =  Tr  x  Tj,  where 
T\ , . . . ,  Tin  and  T\ , . . .  ,Xyn  are  the  respective  calendar  periods  (time  intervals)  and 
covariate  strata.  For  simplicity,  the  time  intervals  are  taken  to  be  of  equal  length 
ln  =  1  /Ln  and  the  covariate  strata  are  taken  to  have  equal  width  wn  =  1  j Jn. 
Grouped  data  consist  of  the  total  number  of  failures  and  the  total  time  at  risk 
(exposure)  in  each  cell  Crj,  given  by 

=YjT  ^ 2 ■•(*)  €  dN&)  and  Yrf  =  E  jT  €  W(0 
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respectively.  All  our  estimators  are  based  on  such  data. 

Let  ao  be  the  underlying  hazard  function  to  be  estimated.  To  carry  out  infer¬ 
ence  for  ao  we  need  to  assume  that  the  support  of  ao,  denoted  supp(ao),  is  known. 
This  assumption  is  needed  to  avoid  the  problem  of  low  exposure  on  the  boundary 
of  the  support.  In  typical  survival  analysis  applications  supp(ao)  can  be  assumed 
to  be  the  whole  of  [0,1]2,  but  there  are  simple  useful  examples  where  this  is  not 
the  case;  e.g.,  the  illness-death  process  with  duration  dependence  (see  Example  3 
of  MU)  in  which  supp(a0)  is  the  triangle  {(t,~)  £  [0,  l]2:  2  <  t). 

We  slightly  modify  the  usual  occurrence/exposure  rate  around  the  boundary  of 
supp(ao)  by  setting  it  equal  to  zero  when  any  part  of  a  cell  falls  outside  supp(ao), 
i.e.,  define 

jV<n) 

a(t,  «)  =  ~^y  for  (<,  *)  €  Crj  C  supp(a0), 

^  rj 

zero  otherwise.  Our  estimators  are  defined  by 


A(*,z)=  /  a(s,z)ds  and  ^[(♦,*)=  /  /  a(s,x)dsdx. 

Jo  Jo  Jo 

We  assume  throughout  that  ao  is  Lipschitz  on  its  support.  The  following  minor 
abuse  of  our  notation  will  be  very  convenient:  for  any  process  (or  set)  £rj  indexed 
by  integers  r  and  j,  define  £tz  for  (t,  z)  £  [0,  l]2  by  £tz  =  £rj  for  (t,z)  £  Crj.  Define 

V=  {(f,z):  Ctz  C  supp(a0)}, 

let  Vz  denote  the  z-section  of  X>,  and  set  Y(n\t,z)  =  £  lz}Y{(t). 

Condition  A 

(Al)  There  exists  a  nonnegative,  Lipschitz  function  /(•,•),  which  is  bounded 
away  from  zero  on  supp(ao),  such  that 


Y(n\t,z) 

riwn 


/(*>*))  dt  =  o(l2n). 


(A2)  Leb{(  €  Vz:YM(t,  z)  =  0}  =  oP(ln). 

/  *  o\  T^(  Tiwn  \ 4 

(A3)  sup(,i06i>,g(>yf")^-)J  <oo. 

(A4)  Leb(XL)  =  0(ien)  +  0(/n),  where  Vz  is  the  set  of  times  t  for  which  Ctz 
overlaps  both  supp(ao)  and  its  complement. 


These  conditions  are  slightly  stronger  than  those  required  in  MU  (1990a),  but 
they  are  still  quite  mild.  In  particular,  Condition  (A4)  is  satisfied  for  the  illness- 
death  model  mentioned  above  since  Leb(Vz)  <  2ln  -f  wn  —  0(ln)  -f  0(ivu)  in  that 
example. 
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THEOREM  2.1  For  a  fixed  z  €  [0,1]  suppose  that  Condition  A  holds.  If  nw\  — ►  0, 
nwnl 2n  — >  0  and  nwn  —>  oo,  then  under  a  =  ao 


U{;z) 


in  Z)[0,1],  where  U(',z)  is  a  continuous  Gaussian  martingale  with  zero  mean  and 
variance  function  * 

Var(U(t,z ))  =  /  h(u,z)du, 

Jo 


where  h  —  ao//. 


In  order  to  derive  the  asymptotic  distribution  of  A  we  need  to  consider  a  se¬ 
quence  of  models  of  the  form  (2.1),  with  each  indexed  by  the  sample  size  and  having 
a  piecewise  constant  over  the  cells  used  to  group  the  data,  cf.  McKeague  (1988). 
At  a  given  sample  size  a  is  assumed  to  be  the  piecewise  constant  approximation  d'0 
to  Q'o  determined  by  the  cells  Crj.  The  approximation  do  is  defined  by 


ao  (t,z)  = 


ao(u,x)dudx  if  Ctz  C  supp(a0), 


zero  otherwise.  The  following  theorem  is  an  extension  to  the  grouped  data  setting 
of  MU’s  (1990b)  asymptotic  normality  result  for  A. 


Condition  B 

(Bl)  ff  -  f(t,z)Y  dtdz  =  o(l2nwl). 

JJV  \  TlWn  ' 

(B2)  Leb2{(t,;)€D:F(n>(t,2)=0}  =  oP(lnwn). 

(B3)  suP((iI)€I,£(^^)4<oo. 

THEOREM  2.2  Suppose  that  Condition  B  holds.  If  ln  — *  0,  wn  — >  0,  =  0(1), 

and  Jn  =  O(n),  then  the  distribution  of  y/n(A  —  A)  under  a  =  d0  converges  in  D2 
to  the  distribution  of  the  process 


m(t,z)=  f  f  h(u,x)dW(u,x ), 

Jo  Jo 


where  h  =  ao//  in  the  interior  of  supp(ao),  zero  otherwise,  and  W  is  a  Brownian 
sheet. 


In  the  following  proposition  we  check  that  Conditions  A  and  B  are  satisfied  in 
the  i.i.d.  case.  Let  (Ni,  Yj,  Zj),  i  =  1, ...  ,n  be  i.i.d.  copies  of  (Ar,  Y,  Z).  Let  F(t,') 


be  the  subdistribution  function  of  the  covariate  process  at  time  t  when  Y(t)  =  1, 
i.e.,  F(tyx)  =  P(Z(t)  <  £,y(<)  =  1),  — oo  <  x  <  oo. 


PROPOSITION  2.1  (i.i.d.  case).  Suppose  that  for  each  t  €  [0, 1],  F(f,  •)  is  absolutely 
continuous  on  the  support  of  ao(f,  *)  in  [0, 1]  with  density  /(f,  •)  such  that  /(•,  •)  is 
Lipschitz  and  bounded  away  from  zero  on  the  support  of  a o-  Suppose  that  ln  — ►  0 
and  nwzn  =  0(1). 

(i)  If  nwnln  — ►  oo,  then  Conditions  (A1)-(A3)  hold, 
oo  If  nw^ln  — *  oo,  then  Condition  B  holds. 

When  using  A  in  the  i.i.d.  case  it  suffices  that  the  interval  widths  wn  and  ln 
satisfy  nw„  — ►  0,  nw £  — +  oo,  and  ln  ~  wn.  When  using  A  in  the  i.i.d.  case  it  suffices 
that  the  interval  widths  satisfy  =  0(1),  nw^J2  — *■  oo,  and  ln  y/w^-  In 
particular,  this  suggests  that  for  A  there  should  be  (asymptotically)  more  covariate 
strata  than  time  intervals.  Whether  such  advice  should  be  followed  in  practice  will 
be  considered  in  the  simulation  section. 

It  is  possible  to  give  a  version  of  our  result  for  A  under  the  model  a  =  cto, 
but  we  would  then  need  nw%  — ►  0,  which  conflicts  with  the  rate  in  Proposition  2.1 
(ii),  so  the  result  would  not  be  useful  in  the  i.i.d.  case.  We  are  able  to  get  around 
this  difficulty  by  restricting  attention  to  the  sequence  of  piecewise  constant  models 
a  =  do.  We  regard  this  as  a  very  natural  approach  in  the  grouped  data  setting. 

3.  Fitting  the  Cox  model  to  grouped  data 

In  the  continuous  data  case  the  regression  coefficient  /?o  is  estimated  by  maxi¬ 
mizing  Cox’s  partial  likelihood  function  which  has  logarithm 

C(0)  =  E/1  PZ-(u)dN,(u)  -  £  log  dN^(u), 

where  =  Yli  Pons  and  Turckheim  (1987)  estimate  /?o  by  maximizing  a 
histogram-type  Cox’s  partial  likelihood  function  which  has  logarithm 

C0P)  =  Y,YljT  ^u)<W,-(iO  +  £log(£jf  eMWYiWdu)  ' jf  dJV<n>(«). 

However,  in  the  grouped  data  case  neither  C(/3)  nor  Ch(fi)  is  observable.  In  fact 
Ch{@)  is  observable  with  grouped  data  only  when  the  covariate  process  Z  takes  at 
most  finitely  many  values  and  is  non-time  dependent. 

For  the  general  grouped  data  case  we  need  to  consider 

cm  -■  £  wl?  -  £  log  ( £  )  w<»>  , 

r,j  r  j 
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where  Nr ArW  1S  number  of  failures  in  the  rth  calendar  period,  and  Zj 

is  a  representative  covariate  value  for  the  j th  stratum.  The  estimator  P  is  defined 
as  a  solution  to  Ug(/3)  =  0,  where  Ug  is  the  derivative  of  Cg.  In  the  piecewise 
exponential  model  of  Holford  (1976),  p  is  the  full  maximum  likelihood  estimator  of 


Po,  and 


for  t  e  Tr , 


is  the  maximum  likelihood  estimator  of  the  baseline  hazard  function.  We  also  need 
the  grouped  data  based  analogue  of  Breslow’s  estimator  of  the  cumulative  baseline 
hazard  function  Ao (t)  =  f0  Aq (u)du  given  by 


As  in  AG,  denote 


AW  =  f 

Jo 


Ao(w)  du. 


S(fc)(/M)  =  -Y  Zht)Y,(t)e0z‘W 

t 

for  k  =  0, 1, 2,  where  0°  =  1.  The  following  conditions  are  assumed  to  hold  through¬ 
out  this  section. 


Condition  C 

(Cl)  (Asymptotic  stability).  There  exists  a  neighborhood  B  of  Pq  and  function 
defined  on  B  x  [0, 1]  such  that 


sup  |S(*>OM)-sWG9,i)lJ-*0  for*  =  0,1,2,  (Cl.l) 

s\ipE(S(-k\Po,t)-s<'k\l3o,t))2  =  O(-)  fort  =  0,1.  (Cl. 2) 

t  \n/ 


(C2)  (Asymptotic  regularity).  Let  be  defined  as  in  Condition  A  and 

define  v  =  s^/s^  —  (s^ / s(0))2 .  For  all  p  Q  B,  t  €  [0, 1]: 

*(,)(/M)  =  ^(0)(/M),  sW(P,t)  = 

s^(-,  •).  k  =  0, 1,2,  are  continuous  functions  on  B  x  [0,  l],  is  bounded 
away  from  zero  on  B  x  [0,1],  and 

v(A>,<)s(0)(/?o,<)M<)* 


6 


is  positive. 

We  shall  also  assume  that  Ao  is  Lipschitz.  Our  conditions  are  slightly  stronger 
than  the  corresponding  conditions  of  AG:  we  assume  a  rate  of  convergence  in  Cl. 2, 
and  that  is  continuous.  Our  first  result,  showing  consistency  of  /?,  does  not 
need  either  the  full  strength  of  Condition  Cl  (just  Cl.l)  or  the  Lipschitz  condition. 
Condition  C  can  be  checked  in  the  i.i.d.  case,  with  Z  and  Y  having  sample  paths 
in  Skorohod  space  D[0, 1],  by  using  similar  arguments  to  Theorem  4.1  of  AG. 

THEOREM  3.1  (Consistency  of  ft).  If  wn  — ►  0  and  ln  —+  0,  then 

P~^Po- 


Theorem  3.2 
then 


(Asymptotic  normality  of  P).  If  nw 2n  — ►  0,  nl \ 


Mi j-w-ms'1)' 


0  and  nln  — »  oo, 


It  follows  from  the  proof  of  Theorem  3.2  that  if  wn  — ►  0  and  ln  — *  0,  then 
n~1  Ig{P)  is  a  consistent  estimator  of  E,  where  Ig(P)  is  minus  the  second  derivative 
of  Cg(P).  The  final  result  in  this  section  gives  the  limiting  distribution  of  A  (cf. 
AG’s  Theorem  3.4). 

Theorem  3.3  If  nw -  — *  0,  ni\  — ►  0  and  nln  — >  oo,  then 

Vn(A  -  Ao)-^mo(-)  +  m^l)^-)  in  D[ 0, 1], 


where 


['  *(1)(A> ,«) 
Jo  sW(/3o,u) 


A0(u) du, 


and  mo  and  mj  are  independent  zero  mean  Gaussian  martingales  with 


Mu)  j 
s<o>(0o,u) 


v(Po,u)s('°\Po,u)\0{u)  du. 


4.  Goodness-of-fit  test  for  the  Cox  model 

In  this  section  we  consider  testing  whether  an  underlying  Cox  model  a0(f ,  c)  = 
Ao(/)  exp{/?o^}  adequately  fits  the  grouped  data.  Here  the  support  of  Ao  is  the  whole 
unit  interval.  Once  again  we  need  to  consider  a  sequence  of  models  a  =  qq  implicitly 
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indexed  by  the  sample  size,  where  Qo  is  the  piecewise  constant  approximation  to  n0 
given  by 


ot0(t,z)  —  A0 (t)e^0Zj  for  r  €  lj , 


(4.1) 


and  Zj  is  a  certain  covariate  value  in  the  j th  stratum.  Under  the  Cox  model  the 
doubly  cumulative  hazard  function  is  estimated  by 


A(t,z)  =  /  /  a(u,x)dudx, 

Jo  Jo 


where 

q(u,  z)  =  \o(u)e^:j  for  2  G  Ir 

The  following  result  can  be  used  to  construct  a  chi-squared  goodness-of-fit  test 
of  the  Cox  model  versus  the  alternative  that  the  Cox  model  does  not  hold,  btised 
on  increments  of  A  —  A.  cf.  MU  (1991). 

THEOREM  4.1  Suppose  that  Conditions  B  and  C  hold.  If  wn  — ►  0,  /„  — ►  0.  nu^l^  = 
0(1),  Jn  =  O(n)  and  nln  — ►  00,  then  the  distribution  of  >/ri(A  —  A)  under  the 
piecewise  constant  Cox  model  a  =  Qo  converges  in  D2  to  the  distribution  of  the 
process 


m{t,z)  = 


where 


n  Vn^)dw{u,x)-b{z)  [  [  yjj&±\dw{u.x) 

Jo  Jo  Jo  Jo  s{O){0o,u) 


h(u,x) - 


/(«>*) 

g(u,x)  =  A  O(u)e0ozf(u,x), 
b(:)=  [  c0°xdx, 

Jo 

c(<,2)  =  S-,{a0(<)  f  xe^dx-b(z) 

L  Jo  Jo  s^)(/30,u)  j 


When  using  A  —  A  in  the  i.i.d.  case  it  suffices  that  wn  and  satisfy  the  same 
rates  as  previously  given  for  A.  Note  that  the  rate  nu’J  — ►  0  needed  for  two  of  our 
Cox  model  results  (in  Section  3)  is  incompatible  with  the  slower  rate  needed  when 
using  A  in  the  i.i.d.  case  (see  Pioposition  2.1  (ii)).  For  this  reason  the  condition 


S 


riWn  — >  0  was  avoided  in  Theorem  4.1,  Fortunately  our  Cox  model  results  hold 
without  this  condition,  provided  a  = 

In  order  to  perform  the  chi-squared  test  we  need  to  estimate  the  functions  /i,  <7, 
b  and  c.  This  is  done  by  inserting  /?,  Ao  and  estimators  of  u)  for  k  —  0, 1  and 

/(a,  re).  Estimate  s^k\ftoyu)  by  Sgk\fttu)  defined  after  the  proof  of  Proposition  2.1 
in  Section  6,  and  estimate  /(u, re)  by  jnwnln  for  u  £  Tr  and  re  €  Xj.  Further 
details  on  the  construction  of  the  chi-squared  test  can  be  found  in  MU  (1991). 

5.  Numerical  results 

5.1  Monte  Carlo  study.  We  have  carried  out  a  simulation  study  of  the  per¬ 
formance  of  our  Cox  model  estimators  and  test  statistics  for  various  sample  sizes, 
censoring  levels,  and  grouping  patterns.  The  parameters  of  the  underlying  Cox 
model  were  taken  to  be  fto  =  1  and  Ao  =  1,  and  the  covariate  was  uniformly  dis¬ 
tributed  on  [0,1].  The  censoring  time  was  independent  of  both  the  failure  time 
and  the  covariate,  and  exponentially  distributed  with  parameter  7.  The  censoring 
parameter  7  was  set  to  0.75  and  2,5,  amounting  to  31%  and  60%  censoring  prior 
to  the  end  of  follow-up.  In  each  case  the  follow-up  period  was  adjusted  to  give  an 
average  of  19%  surviving  beyond  the  end  of  follow-up. 

[Insert  Table  1  here] 

Observed  coverage  probabilities  of  asymptotic  95%  confidence  intervals  for 
fto  are  contained  in  Table  1.  Inspecting  Table  1  we  find  that  all  the  coverage 
probabilities  are  close  to  their  nominal  value  of  ,95.  It  appears  that  variations  in 
sample  size  and  number  of  cells  have  little  effect.  Also,  we  could  find  no  evidence 
of  bias  in  ft.  In  view  of  Holford’s  (1976)  comment  that  ft  is  inconsistent  unless  the 
grouping  becomes  finer  as  the  sample  size  increases,  we  had  expected  to  eventually 
obtain  poor  results  at  very  large  sample  sizes  if  the  number  of  cells  is  kept  small. 
However,  this  effect  only  became  noticeable  for  sample  sizes  above  two  million. 

In  order  check  for  asymptotic  normality  of  ft  we  examined  normal  plots  and 
histograms  of  standardized  values  of  ft.  All  of  these  indicated  that  \fn{p  —  fto)YAi2 
closely  follows  a  standard  normal  distribution. 

We  performed  the  chi-squared  goodness-of-fit  test  of  Cox’s  model  using  incre¬ 
ments  o(  A  —  A  over  a  2  X  2  partition  of  the  grouping  cells,  so  that  there  were  4 
degrees  of  freedom.  The  results  are  displayed  in  Table  2.  Very  similar  results  were 
obtained  when  using  different  degrees  of  freedom,  e.g.  4x4.  Also,  our  results  are 
very  consistent  with  those  obtained  in  the  continuous  data  case,  see  MU  (1991). 

[Insert  Table  2  here] 

Inspection  of  Table  2  indicates  that  the  number  of  covariate  strata  Jn  has 
a  strong  influence  on  the  level  accuracy  of  the  test,  whereas  the  number  of  time 
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intervals  has  little  effect.  For  sample  sizes  less  than  1000  we  recommend  that  at 
most  5  covariate  strata  be  used.  For  sample  sizes  between  5000  and  10,000  we 
recommend  that  about  10  covariate  strata  be  used.  A  very  large  sample  size  (say 
n  >  40,  000)  would  be  required  to  obtain  satisfactory  results  when  using  20  covariate 
strata.  These  recommendations  hold  irrespective  of  the  amount  of  censoring. 

We  occasionally  (in  1-2%  of  cases)  obtained  a  negative  chi-squared  statistic 
at  sample  size  n  =  250  when  using  20  covariate  strata.  The  problem  of  a  negative 
chi-squared  statistic  can  be  caused  by  small  sample  size,  too  many  covariate  strata, 
low  rat^  of  survival  beyond  the  end  of  follow-up,  or  failure  of  the  data  to  fit  the 
Cox  model.  Under  the  Cox  model  the  problem  can  be  avoided  by  grouping  the 
data  so  that  the  total  time  at  risk  in  each  cell  is  sufficiently  large;  in  the  simulation 
study  we  found  that  it  was  enough  to  have  at  least  10%  surviving  beyond  the  end 
of  follow-up  in  each  covariate  stratum.  When  the  data  fail  to  fit  the  Cox  model 
the  use  of  a  model-robust  estimator  of  the  covariance  would  avoid  the  problem  (cf. 
Hjort.  1990.  p.  1254),  but  this  would  be  difficult  to  implement. 

5.2.  An  example  using  the  Japanese  atomic  bomb  survivors  data.  Huffer  and 
McKeague  (1991)  have  studied  the  application  of  Aalen’s  (19S0)  additive  risk  model 
to  grouped  data  on  the  incidence  of  cancer  mortality  among  Japanese  atomic  bomb 
survivors.  It  is  of  interest  to  examine  whether  these  data  can  be  adequately  fitted 
by  the  Cua  model.  The  time  variable  t,  taken  as  time  since  exposure,  is  grouped 
into  eight  4-years  intervals:  5-9,  . . .,  33-37  years.  The  covariate  is  dose  (in  units  of 
rads),  taken  as  the  midpoint  of  one  of  the  six  dose  groups:  0,  1-50,  50-100,  1 00— 
200,  200-300,  >  300,  with  dose=  400  for  dose  >  300.  According  to  our  simulation 
results,  this  grouping  of  the  data  is  adequate  for  the  chi-squared  test. 


Table  3.  Results  of  Cox  Model  Goodness-of-fit  Test  for  Japanese  Atomic  Bomb 
Survivor  Data. 


Age  at  exposure 


0-9 

10-19 

20-34 

35-49 

>  50 

Male 

i  Female  i 

reject 
negative  j 

reject 

reject 

negative 

negative 

reject 
negative  j 

reject 

reject 

We  have  evaluated  the  chi-squared  statistic  separately  for  males  and  females 
in  each  of  5  different  age  at  exposure  groups.  There  were  4  =  2x2  degrees  of 
freedom  in  each  test.  The  results  are  given  in  Table  3.  The  chi-squared  statistics 
indicate  extremely  strong  departures  from  the  Cox  model,  except  in  the  cases  having 
a  negative  chi-squared.  In  fact,  as  noted  above,  negative  chi-squared  values  also 
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suggest  a  lack  of  fit  with  the  Cox  model  when  the  total  time  at  risk  in  any  cell  is 
sufficiently  large  (as  is  the  case  with  these  data). 

6.  Proofs 


The  following  notation  will  be  useful: 


M??  =  Y,Jr  €  Xj)  dMi{u), 

M(n)(t,z)  =  ?i  I{Zi(u)  €lz}dMi(u), 

a(n)(t,z)  =  Y.nZiit)  e  z,}y<(0tt(<,z, •(*)), 

i 

=  f  a^n\u,  z)  du. 

Jrt 

Proof  of  Theorem  2.1  Note  that 

y/nwn(A(t,  z)—A(t,  z))  =  X*(t,z) 

+  V^J ;/“/{«€©,}  (-L - X - r)  MlnJ  du 

Jo  \y£?'  nwJnf(u,z)J 

ft  /  aW  \ 

-f  \/nujn  J  \^{u  eVz}^^  -  aQ(u,z)J  du, 


(6.1) 


where 


y/nwn  J 0  f(u,z) 


and,  given  a  function  0  defined  on  [0,1],  the  piecewise  linear  approximation  ip*  to 
ip  determined  by  the  calendar  periods  Tr  is  defined  by 


=  t/>(<r-l  )  +  ■■■  -J1  -  (V’(tr)  -  V»(*r-1 )) 

for  t  €  Tr  =  (tr_i,fr).  Using  an  analogous  argument  to  the  treatment  of  the  'erm 
Ar^n)  in  the  proof  of  Theorem  1  of  MU  (1990a),  it  can  be  shown  that  X ~^~*U  in 

D[ 0, 1].  Thus,  by  Lemma  4.1  of  McKeague  (1988),  we  have  X *—>U  in  D[0, 1].  The 
rest  of  the  proof  consists  in  showing  that  the  last  two  terms  on  the  r.h.s.  of  (6.1) 
converge  uniformly  in  probability  to  zero. 
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Note  that  the  predictable  variation  process  of  M»  is  =  f*  A,(ti)du,  so 

that,  using  standard  martingale  theory,  £(M^)2  <  0(l)E(Y^).  Also,  from 
Condition  (Al)  and  the  boundedness  of  /, 


-EO'Jr*) du  =  +  O(mun^n). 


It  follows  that 


£(M<">)2  du  =  0(nwnln). 


The  second  term  on  the  r.h.s.  of  (6.1)  is  of  order  0(B\)  +  0(B 2)  uniformly  in  <, 
where 

■Bi  =  -T==r  [  HYl?  =  0}  \Ml?\du, 

y/nwn  ln  J'DX 

b2  =  /  I -  />,  *)  (yj^r1  1  dM. 

Since  Yt2n^  =  0  implies  that  z)  =  0  for  Lebesgue  a.e.  u  E  Tt,  Condition  (A2) 

holds  with  Y(n\tyz)  replaced  by  Thus,  by  the  Cauchy-Schwarz  inequality 

and  (6.2),  B\  is  of  order 


Denote 


op(\/U)Op(\/nwnln )  =  op(l). 

r]u  =  J  7{y(n>(u,2)  >0}dv. 


Since,  for  0  <  6  <  1, 


P(  inf  r,u<8ln)  =  p(  sup  [  7{y<n>(u, z)  =  0}  dv  >  i„(l  -  5)) 

"eP*  \«€D.  / 

<  P(Leb{u  €  P2:y(n)(u,2)  =  0}  >  I„(l  -  6))  -*  0 

by  Condition  (A2),  we  see  that  rj~l  is  of  order  Op(l~l )  uniformly  in  u  €  V; .  The 
Cauchy-Schwarz  inequality  gives 


vl<Yl?  J  iY^z))-1  dv, 


which  yields  an  upper  bound  on  (Yj?))  1 ,  so  that 


i?2  <  Op 


ln  '  JV, 


\nwnln 
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By  Condition  (Al), 


E  (rrV  -/(u’*))  du  =  o(i2n). 

JVinwnln  ' 


Also,  by  (A3), 


EWr}YMMr'd^d‘  =  0&- 


Thus,  using  (6.2)  and  the  Cauchy-Schwarz  inequality  twice,  the  expectation  of  the 
integrand  in  (6.3)  is  of  order 


It  follows  that  J?2  is  of  order  op(l). 

For  u  e  ( Vz  U  X>z)c,  a0(u,z)  =  0.  By  the  Lipschitz  assumption  on  q0,  the  last 
term  on  the  r.h.s.  of  (6.1)  is  bounded  uniformly  in  t  by 

, -  f  |[ao(U,z)  +  0(/n)  +  0(u.„)]ru(?)  ,  w  ,  , -  ft,  NIJ 

Vnwn  - ~  - - - - - a0(u,  z)  du  +  ^nwn  |a0(u,  z)\du 

Jvz\  YiV  Jvz 

<  y/nwn{0(ln)  +  0(w„)  +  0(l)Leb{u  E  V2  :  =  0}}  +  0(y/nw^)Leb{T>z} 


by  (A2),  (A4),  nw\  — ►  0  and  nwnl £  — ►  0.  This  completes  the  proof.  □ 


Proof  OF  Theorem  2.2  By  routine  calculation 


V^(A-A){t,z)  =  x\t,z) 

+  V*  rf  I {u€Vx}(-^r - -L - -)Mlnz>dudx 

Jo  Jo  ^Yix}  nwnlnf(u,x )/ 

at  /  OiJJx  \ 

I{u  E  Vz]  (— -  d0(u,x)J  dudx, 

Yuz 


where 


w,  ,  1  f\,  _  ,MM(du,zA 

*(*.*)  =  -r=  £  /  €  p«}  777,  ,  \  ’ 

vn  JTj  ./o  /(u>  zi) 
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and  given  a  function  xp  on  [0,  l]2,  is  defined  to  be  the  piecewise  bilinear  approx¬ 
imation  to  xp  determined  by  the  cells  Crj,  obtained  by  extending  the  definition  of 
xp*  in  the  obvious  way. 

To  complete  the  proof  it  suffices  to  show  that  the  last  two  terms  of  (6.4)  con- 

verge  uniformly  in  probability  to  0,  and  X — >m,  where  we  are  using  Lemma  4.1  of 
McKeague  (1988)  again.  As  in  the  proof  of  Theorem  3.1  of  MU  (1990b),  we  need  to 
show  that  {A',  n  >  1}  is  tight  in  D2  and  its  finite  dimensional  distributions  converge 
weakly  to  those  of  m.  Tightness  can  be  shown  using  Jn  =  0(n),  see  Lemmas  2  and 
3  of  MU  (1990b).  Convergence  of  the  finite  dimensional  distributions  can  be  shown 
using  Condition  (Bl)  to  obtain  convergence  of  the  predictable  variation  processes 
of  increments  of  A'  (cf.  Lemma  4  of  MU  (1990b)),  and  verifying  the  Lindeberg 
condition  which  is  similar  to  (3.2)  of  MU  (1990b). 

The  second  term  on  the  r.h.s.  of  (6.4)  is  uniformly  of  order  0(G  1)  +  0(G 2), 
where 

Gl  ”  JL 7,r"> “ °* |M“l * dx‘ 

As  in  dealing  with  B\  in  the  proof  of  Theorem  2.1,  the  Condition  (B2)  holds  with 
y(n)(<,  z)  replaced  by  Y^\  By  Condition  (Bl)  and  boundedness  of  /,  we  have  that 
/ Jv  E{M^)2dudx  =  0(nwnln).  Thus,  Gi  is  of  order 


=oP(i). 

Also,  as  in  dealing  with  B2  in  the  proof  of  Theorem  2.1,  define 

r)ux=  l  I{Y(n){v,x)>0}dv. 

Since,  for  0  <  S  <  1, 

P{  inf  rjux  <  Sln)  —  p(  sup  f  I{Y^n\v,  x)  =  0}  dv  >  /n(l  -  6)] 

\(u,x)€I)Vtu  J 

<  P^  JJ  /{y^n^(w,x)  =  0}  dudx  >  xvnln(l  —  6)^ 

<  P(Leb{(u, x)  <E  V:Y^\u,x)  =  0}  >  wnln(  1  -  £))  -  0 

by  Condition  (B2),  we  have  that  rj~p  is  of  order  Op(l~l )  uniformly  in  (xi,x)  6  V- 
So  that 


g2  <  Op(^t)  jjv\^jn~ ^\(JTu  dv)\M^\dudx-  (6-5) 
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By  (Bl),  (B3)  and  Cauchy- Schwarz  equality,  the  expectation  of  the  integrand  in 
(C.5)  is  of  order 


It  follows  that  G2  is  of  order  op(l).  Thus  the  second  term  converges  uniformly  in 
probability  to  zero. 

Consider  the  last  term  of  (6.4).  For  the  piecewise  constant  model,  al’i  = 
Thus,  the  last  term  of  (6.4)  is  of  order 


0(v^)Leb2{(«,*):  Yj:5  =  0}-^0 


by  Condition  (B2)  and  nu;2/2  =  0(1).  This  completes  the  proof.  □ 


PROOF  of  Proposition  2.1  We  only  consider  part  (i);  the  proof  of  part  (ii)  is 
almost  identical.  Under  the  conditions  of  the  proposition,  Y^n\t^z)  has  a  binomial 
distribution  with  parameters  n  and  Jz  f(t,x)dx ,  and  there  exist  positive  constants 
b  and  c  such  that  bwn  <  fz  f(t,x)dx  <  cwn,  for  each  ( t,z )  E  supp(ao).  Since  the 
fourth  central  moment  of  a  binomial  (n,p)  r.v.  is  of  order  0((np)2),  and  using  the 
Lipschitz  assumption  on  /,  as  well  as  iudJ  =  0(1)  and  nwnln  — ►  00,  we  have 


E 


Y<<n\t,z) 


nw 


Tl 


=  0 


V  (™*)4 ) 


0(w\)  =  o{ll) 


uniformly  over  (t,z)  €  supp(ao).  Condition  (Al)  follows  using  the  dominated 
convergence  theorem.  Next, 

£[Leb{t  €  Vz-.Y^n\t,z)  =  0}]  <  (1  -  bwn)n  <  e"1"1'""  =  o{ln) 


bv  nwnln  — >  oo,  giving  Condition  (A2).  Condition  (A3)  holds  by  Lemma  2(i)  of 
MU  (1990a).  □ 


It  is  useful  to  define  the  grouped  data  version  of  given  by 
Slk)(P, t)  =  2-  £  zkY^e *•>  for  t  €  Tr. 

Note  that,  by  Conditions  (Cl.l)  and  continuity  of  in  (C2),  if  wn  — ►  0  and 
ln  — *  0,  then  satisfies  Condition  (Cl.l)  in  place  of  S^k\ 
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Proof  of  Theorem  3.1  Define  X(@)  =  n~1(C(^)  -  C(/90)),  and  Xg(/3)  analo¬ 
gously  for  the  grouped  data  case.  Consider  the  difference  between  X(fl)  and  Xg(j3), 
for  some  fixed  /?.  Routine  manipulation  gives 


Ay/?)  -  A(/?)|  <i|  f  (0  -  00 ){*i  -  Zi(u))l{Zi(u)  €  lj}dNi(u) 

U  r,j,ijTr 


+ 


S{s0)(P,u)' 

o  '^W#)(0b,«). 


H\'°s 


-log 


>(M' 
SW(0 0,«) 


dJV<n>(u). 


The  first  term  on  the  r.h.s.  is  bounded  above  by 

10  -  00 1  sup  |(Zj  -  Zi(u))I{2i(u)  €  lj}\  • 


since  the  width  of  J7-  is  tun  — v  0  and  n^1  N^n\l)  —  Op(l),  see  AG  (p.  1108). 
Similarly,  the  second  term  tends  in  probability  to  zero  by  continuity  of  log,  the 
remark  preceding  the  proof,  and  the  assumption  that  is  bounded  away  from 

zero.  Thus  \Xg(j3)  —  X(f3)\ — >0.  The  result  now  follows  using  the  argument  in 
Section  2.3  of  AG.  □ 


PROOF  OF  Theorem  3.2  Using  a  Taylor  expansion  of  Ug(f3)  around  /30,  and  in¬ 
specting  the  proof  in  the  continuous  data  case  (AG,  p.  1106),  we  reduce  to  showing 


that 


1 

y/n 


U9(M  -  ^(0o)|— >o 


and  —  sup  Ig(ft)  —  /(/?) 
n  Pes 


>0, 


where  U(/3)  is  the  derivative  of  C(/3 ),  and  /(/?)  is  minus  the  second  derivative  of 
C(/?).  Now,  since 


%)N- 

mao)  =  £  /  5(-)«W  -  /  (»). 


where  M ^  =  Yli  Mi,  we  have 


1 


y/n 


u,m-u{h) 


(*i  -  Zi(u))I{Zi(u)  e  Ij)  dM,(u) 


(6.6) 
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+ 


+ 


1 

\fn 

1 

y/n 


f1  (s^u)  _  s^u)\  (n) 

Jo  ($«»(/*.,«)  s'0)(/?o,u)J  {  ) 

[  (j2^oZi(u)i{zM  ei^Y.iu) 

-"^y^SW(p0,u))\0(u)du 
Sg  (poi / 


(6.7) 


(6.8) 


The  integrands  in  (6.6)  are  predictable  and  bounded  in  modulus  by  wn,  so  that, 
using  standard  martingale  theory,  (6.6)  is  of  order  Op(wn). 

Next,  (6.7)  is  bounded  by 


l_  ['  (s™{p0,u) 
\fn  Jo  o,«) 


S^\Pq,u)\ 

*“>(/*>, 


dM{n\u) 


_1_  M(1)(^0:*r) 

%/”  ~  ls(°)(/30,tr) 


S?\fa,tr) 

S(9°\po,tr) 


|  M'n) 


where  Mr"^  is  the  increment  of  over  7)-  and  tr  is  an  arbitrary  point  in  Tr. 

The  integrand  of  the  first  term  is  predictable  and  of  order  op(l)  uniformly  in  u,  by 
Conditions  (Cl.l)  and  (C2),  so,  as  in  dealing  with  (6.6),  the  first  term  above  con¬ 
verges  in  prob ability  to  zero.  To  deal  with  the  second  term,  approximate  Sg^/Sg0^ 
by  S^/S^°\  with  an  error  of  order  Op(wn)  uniformly  in  t.  This  leads  to  the  upper 
bound 


1  Lr 


-=Op(l)  £  £  IS^OWr)  -  5(fc)(A,tr)||^")| 

V  k=0 r=l 

+op(u>„)-hy;iMW|. 


(6.9) 

(6.10) 


Note  that  the  second  moment  of  is  of  order  0(nln  )  uniformly  in  r,  so,  by  the 

Cauchy-Schwarz  inequality  and  Condition  (Cl. 2),  we  have 

E[\^k\po,tr)  -  5(t>(/Mr)||M<">|)  <  0(»r1)1'aO(nJn)1/a  =  0(/n)1/2, 
uniformly  in  r,  for  k  =  0,1.  Thus  (6.9)  is  of  order 


n~1^2Op(l)LnO(ln)1^  =  Op(nln)-''2-^  0 
since  nln  — >  oo.  The  term  (6.10)  is  of  order 

Op(w„)n~1/2LnO(nln)1/2  =  Op(uiJ/l„)1/J-^  0, 
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since  niv^  — ►  0  and  nln  — >  oo.  We  have  shown  that  (6.7)  converges  in  probability 


to  zero. 


Now  consider  the  lase  term  (6.8).  Approximate  the  first  term  of  the  integrand 
by  nS^\  with  an  error  of  order  0(nwn)  uniformly  in  u.  Then,  also  approximating 

S^V40)  by  S^/Sw  ,  we  obtain  an  upper  bound  of  order 

Op(\/nwn)  +  \/n|  (Sw(/30,u)  -  ^j^^5(0)(^o,u))Ao(u)<iu  . 

By  the  Lipschitz  condition  on  Aq,  the  second  term  is  of  order  Op(^Jnln).  Since 
niv^  — ►  0  and  nl„  — ►  0,  it  follows  that  (6.8)  tends  in  probability  to  zero. 

Finally,  consider  the  difference  between  Ig(0)  and  1(0).  Note  that 

m  =  L  i  rN  (u)’ 

and  Ig(0)  is  obtained  by  replacing  5 ^  by  throughout  this  expression.  By  the 
remark  preceding  the  proof  of  Theorem  2.1,  the  difference  between  the  integrands 
in  Ig(0)  and  1(0)  tends  uniformly  in  probability  to  0.  Since  iV(n)(l)  =  Op(n),  it 

follows  that  n-1  sup^  | Ig(0)  —  1(0) | — >0,  completing  the  proof.  □ 

Proof  of  Theorem  3.3  We  shall  use  the  notation  N9(u)  =  N ^  for  u  €  Tr. 
Note  that 


v^(A(t)  -  Ao(<))  =-d-  /*(- — b - — b - )jV»(u)du 

"  V^lnJo  XS?\f),u)  Sl°K0,u)f  v 


+  X*(t)  +  yft{[  - 


N!(u) 


(6.11) 


0  nlnS(0)(Pc,u) 


du  —  Ar 


where 


Consider  the  first  term  on  the  right  hand  side  of  the  decomposition  (6.11).  We  may 
approximate  Sg°\0,u)  by  §(°\0,u)  with  an  error  of  order  0(uin)  uniformly  in  u. 
The  second  moment  of  is  of  order  of  0(riln)  uniformly  in  r,  so  that  N is  of 
order  of  Op(nln)  uniformly  in  r  by  nln  — >  oo.  Thus  the  first  term  is  of  order 

Op{wn)—^=—Op{nln)  =  Op(v/nu>n)-bo 

V^‘n 


uniformly  in  t  and  0. 
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As  in  AG,  denote 


H(6  t)  =  --  f  _  <jj\Kn)(tt) 


and  let  be  the  grouped  version  of  H  obtained  by  replacing  by  S^k\  k  =  0, 1. 
Taylor’s  expansion  gives 


where  /?i  lies  between  P  and  po.  From  the  proof  of  Theorem  3.4  of  AG  (p.1109), 
converges  in  probability  to  — E  *  uniformly  in  ty  for  any  Pi  such  that 

Pi — >fio.  The  difference  between  Hg(/3,t)  and  H(p,t)  is  of  order  op(l)  uniformly 
in  P  and  t.  Thus,  Hg(pi,t)  also  converges  in  probability  to  — E  •  t )  and  it  follows 
from  the  proof  of  Theorem  3.2  and  the  proof  of  Theorem  3.2  of  AG  (p.  1106)  that 

X — ►mi (!)?/>.  We  have  used  the  conditions  nw\  — *  0,  nl \  — ►  0  and  nln  — >  oo  in 
order  to  be  able  to  appeal  to  Theorem  3.2. 

Since  A0  is  Lipschitz  and  nl\  — »  0,  as  in  dealing  with  (6.7)  in  the  proof  of 
Theorem  3.2,  the  last  term  on  the  right  hand  side  of  the  decomposition  of  (6.11) 
differs  from  the  piecewise  linear  approximation  to 


M0(t)  = 


1  7*  dM<<n\u) 

V™  Jo  s(°\p0,u) 


D 

by  at  most  op(l),  uniformly  in  t.  From  the  proof  of  Theorem  3.4  of  AG,  Mo — >mo 
and,  also  using  the  proof  of  Theorem  3.2,  Mo  and  X  are  seen  to  be  asymptotically 
independent.  Thus  (Mo, AT)  converges  in  distribution  to  (mo,mi(l)^),  and,  by 
applying  a  slightly  extended  version  of  Lemma  4.1  of  McKeague  (1988),  so  does 
(M0*,X*).  This  completes  the  proof.  □ 


PROOF  of  Theorem  4.1  The  proof  is  almost  identical  to  the  proof  of  Theorem 
4.1  of  MU  (1991),  so  we  only  sketch  the  required  modifications.  In  (6.8)  of  that 
paper,  M t  replaces  M,  Mq  replaces  Mo,  and  we  appeal  to  Lemma  4.1  of  McKeague 
(1988)  to  show  that  these  substitutions  make  no  difference  asymptotically.  We  also 
need  to  show  that 

s('k\pQlu)  =  f  xkeP°xf(u,  x)  dx  for  k  =  0,1,2,  (6.12) 

Jo 


as  is  immediate  in  the  i.i.d.  case.  Since  x  i— ►  xke&oX  is  Lipschitz  on  bounded  intervals, 

[  S^k\pQ,u)  -  (  xke^oX f(u,x)dx  du  =  op(\/wnln)  +  Op(wn) 

Jo  Jo 
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by  Condition  (Bl).  Thus  the  representation  (6.12)  follows  by  the  triangle  inequality, 
Condition  (Cl.l),  and  continuity  of  s^k\ 

Finally,  note  that  under  the  piecewise  constant  model  the  remainder  terms 
(6.8),  (6.10)  and  the  first  term  in  (6.11)  are  absent,  so  Theorems  3.2  and  3.3  hold 
without  the  condition  nw„  — >  0.  □ 
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Table  1.  Observed  Coverage  Probabilities  of  Asymptotic  95%  Confidence  Intervals 
for  $>. 


Cens 

Ln 

Jn 

Sample  Size 

250 

500 

1000 

5000 

10000 

50000 

5 

5 

.9490 

.9494 

.9492 

.9496 

.9506 

.9438 

10 

5 

.9490 

.9488 

.9496 

.9496 

.9504 

.9456 

20 

5 

.9488 

.9490 

.9498 

.9494 

.9502 

.9450 

31% 

5 

10 

.9510 

.9478 

.9466 

.9508 

.9482 

.9480 

10 

10 

.9510 

.9478 

.9476 

.9508 

.9490 

.9484 

5 

20 

.9492 

.9464 

.9494 

.9504 

.9508 

.9478 

20 

20 

.9488 

.9458 

.9494 

.9502 

.9512 

.9474 

5 

5 

.9516 

.9490 

.9512 

.9428 

.9460 

.9526 

10 

5 

.9524 

.9494 

.9514 

.9428 

.9452 

,9528 

20 

5 

.9520 

.9492 

.9516 

.9428 

.9452 

.9526 

60% 

5 

10 

.9518 

.9460 

.9484 

.9454 

.9476 

.9530 

10 

5 

.9528 

.9466 

.9480 

.9454 

.9480 

.9530 

5 

20 

.9536 

.9466 

.9502 

.9460 

.9468 

.9522 

20 

20 

.9524 

.9462 

,9506 

.9452 

.9470 

.9526 

Table  2.  Observed  Probabilites  of  Rejecting  the  Cox  Model  at  Asymptotic  Level 
5%. 


Cens 

K 

Jn 

Sample  Size 

250 

500 

1000 

5000 

10000 

50000 

5 

5 

.0806 

.0556 

.0608 

.0568 

.0516 

.0472 

10 

5 

.0732 

.0596 

.0576 

,0576 

.0520 

.0472 

20 

5 

.0750 

.0584 

.0594 

.0570 

,0484 

.0480 

31% 

5 

10 

.2160 

.1480 

.1180 

.0704 

.0608 

.0510 

10 

10 

.2298 

.1542 

.1146 

.0800 

.0658 

.0564 

5 

20 

.6534 

.4290 

.3236 

.1744 

.1276 

.0668 

20 

20 

.7364 

.4702 

.3326 

.1836 

.1368 

.0772 

5 

5 

.0702 

.0580 

.0634 

.0592 

.0526 

.0520 

10 

5 

.0644 

.0580 

.0548 

.0576 

.0504 

.0494 

20 

5 

.0644 

.0582 

.0576 

.0600 

.0490 

.0478 

60% 

5 

10 

.1592 

.1176 

.1006 

.0796 

.0682 

.0572 

10 

10 

.1600 

.1114 

.0964 

.0800 

.0668 

.0578 

5 

20 

.4662 

.2830 

.2196 

.1576 

.1308 

.0798 

20 

20 

.5306 

.3056 

.2222 

.1622 

.1348 

.0888 

NOTES:  The  data  were  generated  using  the  uniform  random  number  generator  of 
Marsaglia,  Zaman  and  Tsang  (1990).  The  number  of  samples  in  each  run  was  5000. 


UNCLASSIFIED 


graararararcM 


MASTER  COPY 


FOR  REPRODUCTION’  PURPOSES 


U.  REPORT  SECURITY  CLASSIFICATION 
Unrl  ?<?ci  fi'  pH 


2a.  SECURITY  CLASSIFICATION  AUTHORITY 


2b.  DECLASSIFICATION /DOWNGRADING  SCHEDUL 


4,  PERFORMING  ORGANIZATION  REPORT  NUMBER(S) 

FSU  Technical  Report  No.  M-840 


6a.  NAME  OF  PERFORMING  ORGANIZATION  6b.  OFFICE  SYMBOL 
i-.-,  ,  _  ...  .  (If  applicable) 

Florida  State  University 


6c  ADDRESS  ( Gty ,  State,  and  ZIP  Code) 

Department  of  Statistics 

Tallahassee,  FL  32306-3033 


REPORT  DOCUMENTATION  PAGE 


b.  RESTRICTIVE  MARKINGS 


6a.  NAME  OF  FUNDING /SPONSORING 
ORGANIZATION 

U".  S.  Army  Research  Office 


8c  ADDRESS  (C/ty,  State,  and  ZIP  Code) 

P.  0.  Box  12211 


8b.  OFFICE  SYMBOL 
(If  applicable) 


3.  DISTRIBUTION /AVAILABILITY  OF  REPORT 

Approved  for  public  release; 
distribution  unlimited. 


5.  MONITORING  ORGANIZATION  REPORT  NUMBER(S) 

USARO  Technical  Report  No.  D  —  1 1  7 


7a.  NAME  OF  MONITORING  ORGANIZATION 

U.  S.  Army  Research  Office 


7b.  ADDRESS  (Crty,  State,  and  ZIP  Code) 

P.  0.  Box  12211 

Research  Triangle  Park,  NC  27709-2211 


9.  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 


10.  SOURCE  OF  FUNDING  NUMBERS 


PROGRAM 

PROJECT 

TASK 

WORK  UNIT 

Park,  NC  27709-2211 

ELEMENT  NO. 

NO. 

NO* 

* 

ACCESSION  NO. 

11.  TITLE  (Include  Security  Gasification) 

On.  the  analysis  of  grouped  survival  data  using  cumulative  occurrence/ 
exposure  rates 


12.  PERSONAL  AUTHOR{$} 

Ian  W.  McKeague  and  Mei-Jie  Zhang 


13b.  TIME  COVERED 
FROM  TO _ 


13a.  TYPE  OF  REPORT 

Technical 


14.  DATE  OF  REPORT  (Year,  Month,  Day)  h*.  PAGE  COUNT 

March  1991  '  25 


16.  SUPPLEMENTARY  NOTATION 


The  view,  opinions  and/or  findings  contained  in  this  report  are  those 


17. 

COS  ATI  CODES  1 

FIELD 

GROUP 

SUB-GROUP  1 

18.  SUBJECT  TERMS  {Com/nue  on  reverse  if  necessary  and  identify  by  block  number) 
Counting  processes,  conditional  hazard  function, 
grouped  survival  data,  martingale  central  limit 
theorem,  proportional  hazards  model,  nonlinear 


19.  ABSTRACT  (Continue  on  reverse  if  necessary  and  identify  by  biock  number)  regression  . 


See  Back 


20.  DISTRIBUTION /AVAILABILITY  OF  ABSTRACT  21.  ABSTRACT  SECURITY  CLASSIFICATION 

□  unclassified/unlimited  E3  SAME  AS  RPT.  □  dtic  users  Unclassified 


22a.  NAME  OF  RESPONSIBLE  INDIVIDUAL  22b.  TELEPHONE  (Include  Area  Code)  I  22c.  OFFICE  SYMBOL 


DO  FORM  1473, 84  MAR 


83  APR  edition  may  be  used  until  exhausted. 
All  other  editions  are  obsolete. 


SECURITY  CLASSIFICATION  OF  THIS  PAGE 

UNCLASSIFIED 


IFIED 


*KCU*1TY  CUAMiriCATJON  Or  THI»  P*OK 


SECURITY  classi  Ft  cation  OF  THIS  PACE 


